Adjacent Nucleotide Dependence in ncRNA and Order-1 SCFG for ncRNA Identification

نویسندگان

  • Thomas K. F. Wong
  • Tak-Wah Lam
  • Wing-Kin Sung
  • Siu-Ming Yiu
چکیده

BACKGROUND Non-coding RNAs (ncRNAs) are known to be involved in many critical biological processes, and identification of ncRNAs is an important task in biological research. A popular software, Infernal, is the most successful prediction tool and exhibits high sensitivity. The application of Infernal has been mainly focused on small suspected regions. We tried to apply Infernal on a chromosome level; the results have high sensitivity, yet contain many false positives. Further enhancing Infernal for chromosome level or genome wide study is desirable. METHODOLOGY Based on the conjecture that adjacent nucleotide dependence affects the stability of the secondary structure of an ncRNA, we first conduct a systematic study on human ncRNAs and find that adjacent nucleotide dependence in human ncRNA should be useful for identifying ncRNAs. We then incorporate this dependence in the SCFG model and develop a new order-1 SCFG model for identifying ncRNAs. CONCLUSIONS With respect to our experiments on human chromosomes, the proposed new model can eliminate more than 50% false positives reported by Infernal while maintaining the same sensitivity. The executable and the source code of programs are freely available at http://i.cs.hku.hk/~kfwong/order1scfg.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-coding RNA finding based on probabilistic secondary structure information

Non-coding RNAs (ncRNAs) are under intensive research focus since several years ago. Whereas many researches have done in this field since [1], not so much knowledge about ncRNAs are gained so far. One of the reasons for this is that there is not enough computational tools available for ncRNA analysis. ncRNA finding is one of the most important tools for the analysis. However, no viable solutio...

متن کامل

Grammar string: a novel ncRNA secondary structure representation

Multiple ncRNA alignment has important applications in homologous ncRNA consensus structure derivation, novel ncRNA identification, and known ncRNA classification. As many ncRNAs’ functions are determined by both their sequences and secondary structures, accurate ncRNA alignment algorithms must maximize both sequence and structural similarity simultaneously, incurring high computational cost. F...

متن کامل

Computational identification of noncoding RNAs in E. coli by comparative genomics

Some genes produce noncoding transcripts that function directly as structural, regulatory, or even catalytic RNAs [1, 2]. Unlike protein-coding genes, which can be detected as open reading frames with distinctive statistical biases, noncoding RNA (ncRNA) gene sequences have no obvious inherent statistical biases [3]. Thus, genome sequence analyses reveal novel protein-coding genes, but any nove...

متن کامل

Predicting Non-protein-coding RNA Genes in Escherichia Coli Using SVM with Signature Descriptor

Non-protein-coding RNA (ncRNA) genes are known to play significant roles. Along with transfer RNAs, ribosomal RNAs and mRNAs, ncRNAs contribute to gene splicing, nucleotide modification, protein transport and regulation of gene expression. Several methods exist for predicting ncRNA genes in Escherichia coli (E.coli). In this paper, we describe a very general, highthroughput method for predictin...

متن کامل

Large-scale profiling of noncoding RNA function in yeast

Noncoding RNAs (ncRNAs) are emerging as key regulators of cellular function. We have exploited the recently developed barcoded ncRNA gene deletion strain collections in the yeast Saccharomyces cerevisiae to investigate the numerous ncRNAs in yeast with no known function. The ncRNA deletion collection contains deletions of tRNAs, snoRNAs, snRNAs, stable unannotated transcripts (SUTs), cryptic un...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2010